Clonality and Diversity in the Soft Rot Dickeya solani Phytopathogen

Bacterial diversity analyses often suffer from a bias due to sampling only from a limited number of hosts or narrow geographic locations. This was the case for the phytopathogenic species Dickeya solani, whose members were mainly isolated from a few hosts–potato and ornamentals–and from the same geographical area–Europe and Israel, which are connected by seed trade. Most D. solani members were clonal with the notable exception of the potato isolate RNS05.1.2A and two related strains that are clearly distinct from other D. solani genomes. To investigate if D. solani genomic diversity might be broadened by analysis of strains isolated from other environments, we analysed new strains isolated from ornamentals and from river water as well as strain CFBP 5647 isolated from tomato in the Caribbean island Guadeloupe. While water strains were clonal to RNS05.1.2A, the Caribbean tomato strain formed a third clade. The genomes of the three clades are highly syntenic; they shared almost 3900 protein families, and clade-specific genes were mainly included in genomic islands of extrachromosomal origin. Our study thus revealed both broader D. solani diversity with the characterisation of a third clade isolated in Latin America and a very high genomic conservation between clade members.


Introduction
Different forces including spreading via worldwide trade routes, dissemination to different hosts or survival in various environments drive structures of plant pathogen populations.This results in various degrees of genomic diversity from one bacterial species to another.The genus Dickeya is one example of such variations in diversity depending on the species [1,2].
Dickeya members are collectively responsible for soft rot, blackleg, stem rots, stunting, cankers and wilting on a large number of plant hosts, both monocots and dicots, including mostly vegetables and ornamentals but also trees [3].Their most important virulence factor is the production and secretion of a battery of plant cell-wall-degrading enzymes that provoke the maceration of plant tissues, leading to cell lysis and release of nutrients.However, virulence also relies on several other factors that allow these bacteria to adapt to environmental changes encountered in planta and to face the stresses produced by plant defence responses [4].
The Dickeya genus consists of 13 species presenting differences in environmental habitats, plant host range and genomic diversity [1,3,[5][6][7].Among the species with a sufficient number of genomes available for diversity studies, several like D. fanghzongdai, D. dadantii, D. chrysanthemi and the D. zeae complex harbour a high genomic diversity with some strains being at the limit of species definition [1,2,8].The previously described D. zeae species was even recently split into three species-D.zeae, D. oryzae and D. parazeae [2,6,7], but some strains of the two first species are still highly divergent from the type strains [2].
By contrast, D. dianthicola and D. solani members have been reported as genomically closely related [1,[9][10][11][12][13][14].Both species were responsible for important outbreaks in the same host potato, and consequently, most available genomes belong to isolates from this host plant.This introduced a bias due to the analysis mainly of strains isolated from only one host and only from Europe and the US.A recent study analysing D. dianthicola genomic diversity using a larger strain panel encompassing more strains isolated from potato over a wider time scale and more strains isolated from hosts other than potato revealed a higher diversity [15].
The host range of D. solani is quite narrow.This species was first described following an important outbreak ravaging potato in Europe in the 2000s [11].Besides potato, it has been isolated only from two ornamentals, hyacinth and muscari [3].Even if these strains have been isolated over a long time period [16], most of them are clonal, differing only from a few dozen to a few hundred SNPs [12][13][14][15][16][17].The potato isolate RNS05.1.2A,however, is clearly distinct from other D. solani genomes, harbouring more than 30,000 SNPs with them [16].Recently, two other strains isolated either from potato or from river water were shown to belong to this RNS05.1.2Aclade [14].A few genomes harbour medium diversity due to replacing horizontal gene transfers from D. dianthicola or RNS05.1.2Ain the main core strain genome [16].Concerning the accessory genomes, genomic variations in D. solani consist mainly in the presence of genes related to phages or even complete prophages [14,18].Despite their very high genetic homogeneity, D. solani strains may vary considerably in their virulence levels (aggressiveness) as well as in their production of virulence factors and motility [18].Analysis of D. solani biodiversity at the proteome level using matrix-assisted methods has unveiled intraspecies variations among 20 D. solani strains differing in their virulence-associated features in addition to the country of origin and year of isolation.These strains were grouped into four delineated clades and differed also in growth rate, plant tissue macerating potential and protease activities, in contrast to the high uniformity among the metabolic profiles, which diverged only in terms of gelatinase activity.However, a definitive link between pathogenicity and the recorded MALDI-TOF MS spectra could not be established [19].
In this work, we investigated if D. solani genomic diversity might be broadened through analysis of strains isolated from other environments: ornamentals, river water and a new host.

Collection of New D. solani Strains Isolated in Different Environments
To broaden the sources of D. solani strains analysed for genomic diversity, we searched for strains isolated in environments other than potato (Table 1).
As the genomes of only two D. solani strains isolated from ornamentals have been analysed so far, we analysed two other strains isolated from hyacinth in the Netherlands: PO2019 and PO3796 (courtesy of Jan van der Wolf).We also took advantage of the two-year survey of soft rot Pectobacteriaceae we recently conducted in the Durance catchment, a river from the south of France [20] to chase D. solani isolates from river water.Four isolates were retrieved: FVG2-MFV017-A9 isolated from the Canal de Crillon, an irrigation canal fed by the river Durance, FVG9-S3-A17-E1 isolated from the Durance River itself and FVG13-S21A17-D9 and FVG14-S21A17-C8, both from a tributary of the Durance, the Grand Anguillon river (see Material and Methods for isolation procedure).New D. solani isolates genomically divergent from the type strain were also searched in the French Collection for Plant-associated Bacteria, CIRM-CFBP (https://cirm-cfbp.fr/;accesses on 12 December 2023), by using multilocus sequence analysis (MLSA) performed on the basis of recA, leuS and dnaX housekeeping genes.This allowed us to identify strain CFBP 5647 (isolated in Guadeloupe from tomato) that was mapped in a branch of the MLSA tree distinct both from the type strain and from RNS05.1.2A(Figure S1).The genomic characteristics of these seven strains are presented in Table 1.Furthermore, the complete sequence of CFBP 5647 was obtained using a hybrid assembly of Illumina short reads and ONT long reads.It consists of a single circular chromosome totalling 5,248,586 bp with a GC content of 56.2%.Annotation of this genome using the Rapid Annotation using Subsystems Technology (RAST) generated a total of 4978 predicted genes, including 4883 protein coding sequences, 74 tRNAs and 21 rRNAs organised in seven operons.To analyse the D. solani diversity, we compared our seven new genomes with the genomes of the three D. solani strains of the minor subgroup exemplified by strain RNS05.1.2A[14,16] and seven members of the previously defined main subgroup here defined as the "core" clade.Out of the 40 "core" genomes present in the NCBI database, we chose the genomes of the type strain IPO2222, the two strains isolated from ornamentals PPO9019 and PPO9134, the strain RNS07.7.3B that harbours horizontal genetic transfers from strain RNS05.1.2A[16], the well-studied strains RNS08.23.3.1.A and Ds0432-1 [16,[21][22][23][24] and the strain D12 from Russia, far from the Western European seed trade network.
A whole genome multilocus sequence analysis (MLSA) phylogenomic tree built up from concatenated sequences of 3403 core proteins (Figure 1) showed that all strains isolated from French watersheds belong to the minor clade related to RNS05.1.2A,while the two additional strains isolated from ornamentals belong to the core clade.Strains within each of these clades appeared highly related.The Caribbean strain CFBP 5647 isolated from tomato forms a third "clade".
Calculation of average nucleotide identity (ANI) values between all genomes of our strain panel confirmed the high relatedness between strains of the same clade.Indeed, the genomes of both minor and core clades shared 99.9-100% ANI between members of the same clade.The tomato strain CFBP 5647 exhibited 98.5-98.7%ANI with the other strains of our panel, a value that is in the range of ANI values between each member of the core clade compared to members of the minor RNS05.1.2Aclade (98.7-98.8%)(Supplementary Table S1).Calculation of average nucleotide identity (ANI) values between all genomes of our strain panel confirmed the high relatedness between strains of the same clade.Indeed, the genomes of both minor and core clades shared 99.9-100% ANI between members of the same clade.The tomato strain CFBP 5647 exhibited 98.5-98.7%ANI with the other strains of our panel, a value that is in the range of ANI values between each member of the core clade compared to members of the minor RNS05.1.2Aclade (98.7-98.8%)(Supplementary Table S1).
The relationships between the different D. solani strains were further addressed by comparing the protein-coding sequences of the 17 genomes using the SiLix gene family clustering tool.Proteins were classified as homologous to others in a given family if the amino acid identities were above 70%, with 80% minimal overlap.All D. solani strains shared 3873 protein families representing 79% to 86% of the genome content.For the core and RNS05.1.2Aclades, as many as 4216 and 4456 protein families are common, representing 91-94% and 92-95% of the genome content, respectively.Reciprocally, members of the RNS05.1.2Aclade only exhibited 13 to 38 specific protein families.Members of the core clade displayed 5 to 125 specific protein families, with the highest content displayed by the strains isolated from ornamentals (25 to 125 specific protein families).For its part, the CFBP 5647 strain displayed 456 specific protein families (see below).

Relatedness of D. solani Strains by SNP Analysis
To gain further insight into D. solani relatedness, we constructed a Minimum Spanning Tree (MST) based on the whole set of single nucleotide polymorphisms (SNPs) present in the core genome identified in our strain panel (Figure 2).The relationships between the different D. solani strains were further addressed by comparing the protein-coding sequences of the 17 genomes using the SiLix gene family clustering tool.Proteins were classified as homologous to others in a given family if the amino acid identities were above 70%, with 80% minimal overlap.All D. solani strains shared 3873 protein families representing 79% to 86% of the genome content.For the core and RNS05.1.2Aclades, as many as 4216 and 4456 protein families are common, representing 91-94% and 92-95% of the genome content, respectively.Reciprocally, members of the RNS05.1.2Aclade only exhibited 13 to 38 specific protein families.Members of the core clade displayed 5 to 125 specific protein families, with the highest content displayed by the strains isolated from ornamentals (25 to 125 specific protein families).For its part, the CFBP 5647 strain displayed 456 specific protein families (see below).

Relatedness of D. solani Strains by SNP Analysis
To gain further insight into D. solani relatedness, we constructed a Minimum Spanning Tree (MST) based on the whole set of single nucleotide polymorphisms (SNPs) present in the core genome identified in our strain panel (Figure 2).
As previously reported [12][13][14]16,17], members of the core clade only differ from each other by a few SNPs, except the potato strain RNS07.7.3B and the two ornamental strains PP09019 and PPO9134 that suffered gene replacement by horizontal gene transfer and homologous recombination with RNS05.1.2Aand D. dianthicola, respectively.Notably, the two new genomes of strains isolated from ornamentals analysed in this study do not present such replacement and differ from the type strain IPO222 only by very few SNPs (Figure 2).Such clonality was also observed between members of the RNS05.1.2Aclade.The five strains isolated from the Durance catchment only harbour 2 to 4 SNPS between each other.The two strains isolated from potato were a bit more distinct, harbouring 17 and 42 SNPs with the closest strain isolated from water.As previously reported [12][13][14]16,17], members of the core clade only differ from each other by a few SNPs, except the potato strain RNS07.7.3B and the two ornamental strains PP09019 and PPO9134 that suffered gene replacement by horizontal gene transfer and homologous recombination with RNS05.1.2Aand D. dianthicola, respectively.Notably, the two new genomes of strains isolated from ornamentals analysed in this study do not present such replacement and differ from the type strain IPO222 only by very few SNPs (Figure 2).Such clonality was also observed between members of the RNS05.1.2Aclade.The five strains isolated from the Durance catchment only harbour 2 to 4 SNPS between each other.The two strains isolated from potato were a bit more distinct, harbouring 17 and 42 SNPs with the closest strain isolated from water.

Genomic Synteny
We compared the genomic architecture of the tomato strain CFBP 5647 with the type strain IPO2222 and the reference strain RNS05.1.2Ausing the MAUVE whole-genome alignment (Figure 3).The three genomes are highly syntenic with only one large inversion in CFBP 5647 as compared to the two other genomes.The alignment also highlighted specific regions that contain several genes related to phages and mobile genetic elements.This prompted us to predict integrated prophages based on PHASTER classification.CFBP 5647 harbours three complete prophages from which one is highly related to the Mutator phage Mu (P3, Table 2).RNS05.1.2Acarries five complete prophages from which two are related to the CFBP 5647 prophage 1 (P2 and P3).RNS05.1.2Aprophage P1 is included in a roughly 200,000 Kb complex repeat region (marked by a star in Figure 3) that gathers several genes linked to phages, mobile elements and methyl-accepting chemotaxix proteins (MCPs).No complete prophage was detected in IPO2222.As there were still other uncharacterised specific regions in the MAUVE alignment, we used another approach to search for clade-specific genes by using the SiLix gene family clustering tool.We compared the genomic architecture of the tomato strain CFBP 5647 with the type strain IPO2222 and the reference strain RNS05.1.2Ausing the MAUVE whole-genome alignment (Figure 3).The three genomes are highly syntenic with only one large inversion in CFBP 5647 as compared to the two other genomes.The alignment also highlighted specific regions that contain several genes related to phages and mobile genetic elements.This prompted us to predict integrated prophages based on PHASTER classification.CFBP 5647 harbours three complete prophages from which one is highly related to the Mutator phage Mu (P3, Table 2).RNS05.1.2Acarries five complete prophages from which two are related to the CFBP 5647 prophage 1 (P2 and P3).RNS05.1.2Aprophage P1 is included in a roughly 200,000 Kb complex repeat region (marked by a star in Figure 3) that gathers several genes linked to phages, mobile elements and methyl-accepting chemotaxix proteins (MCPs).No complete prophage was detected in IPO2222.As there were still other uncharacterised specific regions in the MAUVE alignment, we used another approach to search for cladespecific genes by using the SiLix gene family clustering tool.

Analysis of Clade-Specific Genes
The SiLix analysis revealed that 456, 137 and 221 protein families are present only in CFBP 5647, the core clade and the RNS05.1.2Aclade, respectively.More than half of the CFBP 5647-specific protein families are clustered in four genomic regions (Table 3).Two of them showed similarities with ICEs (Integrative Conjugative Element).GR3 groups 65 protein families from which ten are related to plasmid replication, recombination and segregation functions of the Pseudomona fluorescens mobile genomic island 01 (PFGI-1) [25].Even if a homolog of VirD4, an ATPase involved in T-DNA transfer, is present, GR3 does not carry a Pil or T4SS-like operon necessary for the biogenesis of a pilus allowing conjugative transfer.In addition, GR3 carries mainly genes encoding hypothetical proteins and some proteins involved in metabolism.GR5 presents characteristics of a Tn4371-related ICE that is partially present in the model D. dadantii strain 3937 [23,26].This ICE regroups a conjugative Trb transfer system as well as VirD2 and VirD4 associated homologs, a plasmid-replication machinery, a DNA restriction system, an efflux transport system and some proteins putatively involved in metabolism including NRPS and PKS.GR7 regroups protein families involved in a second T4SS protein secretion system and an efflux system, while GR8 regroups a complete Mu prophage.Notably, CFBP 5647 possesses a CRISPR cluster (GR6), while no such systems have been described previously in D. solani.All CFBP 5647 genomic regions present signs of extrachromosomal origin (Table 3).Members of the core clade only harbour five short genomic regions (5 to 15 protein families) consisting of extrachromosomal elements, hypothetical proteins or proteins involved in metabolism.This clade is the only one whose members possess the cluster of genes encoding endotoxins of the Bacillus thurigiensis-related Cyt family active on insects that have been characterised in D. dadantii (GR3) [27].RNS05.1.2Aprophages 3 and 4 are missing in RNS10-105-1A and A623-S20-A17, respectively, and thus do not appear in clade-specific genes.

Virulence of New Isolates
As even highly closely related D. solani strains might vary greatly in their aggressiveness (here defined as the quantitative production of symptoms in contrast to virulence defined as the qualitative ability of producing symptoms) [18], strains isolated from water in this study and the Caribbean tomato strain CFBP 5647 were tested for aggressiveness both on potato tubers and on detached chicory leaves (Figure 4).These strains were able to cause symptoms on both plants.Like the potato strain RNS05.1.2A,these strains were even more aggressive than the type strain IPO222 on chicory leaves (student test p < 0.05) (Figure 4B).

Discussion
Previous studies of D. solani genomic diversity [1,[12][13][14]16,17] revealed the clonality of this bacterial species with the notable exception of strain RNS05.1.2Aand two recently described related strains [14].The analysed strains were, however, mainly isolated from the same host-potato-and from the same geographical area-Europe and Israel-via seed trade [28].This points to the spread of a seminal clone that invaded the continent by seed trade dissemination.That is why in this work we extended D. solani genomic analysis to strains isolated from more diverse sources.The newly studied strains belong to three genomic clades exhibiting around 98% ANI values between each other: the core clade including most potato strains and two strains isolated from ornamentals, the minor clade with as reference strain RNS05.1.2Aand a third "clade" defined by the strain CFBP 5647 isolated from tomato in Guadeloupe.
The two additional strains isolated from ornamentals belong to the core clade.They do not harbour signs of replacing horizontal gene transfer as described in the two previous strains analysed [16] and carry only a few SNPs as compared to other members of this clade.This reinforced the commonly accepted hypothesis that D. solani emerged in potato following transfer from ornamentals [29,30].The four strains isolated from surface waters all belong to the clade exemplified by strain RNS05.1.2A.They harbour extremely few SNPs between each other and the other already analysed strain also isolated from the river Durance watershed [31].As three of these strains have been isolated from the river Durance or canals fed by this river, we cannot exclude spreading of a single clone via the river.However, strains FVG13-S21A17-D9 and FVG14-S21A17-C8 have been isolated from a tributary of the Durance, implying at least two introduction sources in river waters.The two other members of this clade-RNS05.1.2Aand RNS10-105-1A-both isolated from potato in the north of France, exhibit a few dozen SNPs with water strains and between each other.We may question if related strains are present in waterways in the north of France and might be a source of contamination explaining the occurrence of this rare clade in potato.
To tackle this, it would be interesting to analyse the D. solani strains present in waterways in this region.
So, our study revealed both broader D. solani diversity with the characterisation of a third "clade" isolated in Latin America and a very high genomic conservation between clade members.In the Dickeya genus, diversity varies from one species to the other, with species like D. zeae and D. fanghzongdai encompassing strains so diverse that they are at the limit of species appurtenance, while other species are much more related [2].In particular, the species D. dianthicola, another strain particularly studied in relation to potato outbreaks, shows both a high diversity and a high relatedness of strains isolated over a large scale of time and geographical origin [15].As for D. solani, this highlights the high genomic stability for some members of this species.Similarly, in Pectobacterium brasiliense, Jonkheer et al. [32] reported such a high relatedness of several strains within a clade of strains isolated from different locations, besides a clade of more diverse strains.In this organism, clonality was related to a high aggressiveness.We have no indications of such a relationship in D. solani.Indeed, Golanovska et al. [18] showed that members of the core clade may vary greatly in their aggressiveness and, at least on detached plant organs, we did not observe differences in virulence between the three D. solani clades (Figure 4).
The previously reported D. solani core genome is very large, encompassing almost 4000 genes [1].Adjunction of the more diverse Caribbean tomato strain CFBP 5647 did not reduce significantly this core genome, since out of the 3956 protein families present both in the core and RNS05.1.2Aclades analysed in this study, only 81 are missing in CFBP 5647.As in other Dickeya species, the repertoire of virulence-associated genes is conserved in D. solani with only the presence of a split pelN gene in CFBP 5647.
Such a large core genome is characteristic of many Dickeya species in which the members share around 3500 protein families [1,2].Notable exceptions are D. chrysanthemi [1], D. oryzae [7] and D. dianthicola, for which analysis of more diverse strains [15] revealed a core genome of only around 3000 protein families.This is, however, still very large if compared to, for example, E. coli, for which, using less stringent conditions (50% identity on 50% of the length of the proteins), the core genome was estimated to comprise only about 1500 orthologous genes [33].
Conversely, each of the three D. solani clades only harbouwred one to a few hundred specific genes.Many of them are clustered in genomic regions that often present signs of horizontal gene transfers or are associated with extrachromosomal elements as already reported for the core clade [12].
In conclusion, our analysis of D. solani showed that, besides the clonal spreading of ornamental and potato strains through Europe and Israel, another clade of clonal strains is observed, only in France until now, both in potato and in surface water.Further diversity was observed in a strain isolated from tomato in a Caribbean island.As already noticed for the related species D. dianthicola [15], it is thus critical in bacteria diversity studies to broaden the sources of analysed strains by varying both geographic locations and environments/hosts.The recent reports of D. solani occurrences in Latin America on diverse hosts [28,34,35] will clearly help to further characterise the diversity of this species.

dnaX-leuS-recA Phylogeny of Dickeya Solani Strains from CIRM-CFBP
Portions of genes dnaX, leuS and recA were sequenced following the protocol described for Pectobacterium isolates [36].The sequences obtained for this study were manually checked, aligned and concatenated to reconstruct a phylogenetic tree (Figure S1).This allowed the identification of the distantly related strain CFBP 5647.

DNA Extraction, Genome Sequencing and Assembly
The bacterial genomes used in this study are presented in Table 1.Strains originating from water were isolated by filtering of 500 mL of surface water using 0.22 µm filters.The bacteria retained on the filters were suspended in water and serially diluted onto crystal violet pectate (CVP) medium, a semi-selective medium containing pectin that is widely used for the isolation of pectinolytic bacteria of the genera Pectobacterium and Dickeya [37].Colonies forming pits on CVP plates were grown overnight in liquid medium (LB without NaCl), and qPCR amplifications were performed out of bacterial cell lysates with primers pelD1118d-F (VRC BTA CAA ACC SAC TCT G) and pelD1200d1-R (TGC GTT GYT RTT GAT GCT G), derived from the sequence of the gene pelD that is specific to the genus Dickeya.The Dickeya candidates were further purified on CVP plates and then on LB-plates (LB medium without added NaCl).
Total bacterial DNA was extracted from pure bacterial cultures using the Wizard genomic DNA purification kit (Promega) following the manufacturer's protocol.
DNA quality checks (using the Qubit HS kit (ThermoFisher, Waltham, MA, USA) and visual quality checks on agarose gels) and genome sequencing were performed at the next-generation sequencing core facilities of the Institute for Integrative Biology of the Cell (Avenue de la Terrasse 91190 Gif-sur-Yvette France).Nextera DNA libraries were prepared from 50 ng of high-quality genomic DNA.Paired-end 2 × 75 pb sequencing was performed on an Illumina NextSeq500 apparatus with a High Output 150 cycle kit.CLC Genomics Workbench (Version 9.5.2,Qiagen Bioinformatics, Venlo, The Netherlands) was used to assemble reads.Final sequencing coverage was between 66 and 197.Genomic data for the new genome assemblies are presented in Table S2.Coding sequences were predicted and annotated using the Rapid Annotation using Subsystems Technology (RAST) server [38] with the Glimmer 3 prediction tool [39].For the sake of comparison, genomes extracted from NCBI were imported in the RAST pipeline and re-annotated using Glimmer 3.
The complete genome of the CFBP 5647 strain was obtained using a hybrid assembly of Illumina short reads and ONT long reads.Long-read sequencing was performed on the genotoul platform (https://www.genotoul.fr;accessed 12 December 2023) with Oxford Nanopore GridION technology.The hybrid assembly of reads from the two sequencing strategies was performed using Unicycler v0.4.9 with default parameters [40].

Genome Analysis
For all programs described in the following sections, default parameters were used.Pairwise comparison of the genomes was computed using the average nucleotide identity (ANI) in the Pyani python module v0.2.10 (https://github.com/widdowquinn/pyani; accessed 12 December 2023) [41] with the BLAST algorithm (ANIb).The species threshold was set at 96%.
Orthologous sequences were clustered into homologous families using the SiLix software package v1.2.9 [42] with a 70% identity threshold and at least 80% overlap.Strainspecific and clade-specific gene families and gene families absent in only one of the analysed genomes were extracted from the SiLix output.For the construction of MLSA trees, common genes defined as genes present in all strains meeting the criteria of 70% identity threshold and at least 80% overlap were aligned using MUSCLE [43] software v5.1 and were filtered using the GBLOCK tool [44].The alignments were used for building a phylogenetic tree with the BioNJ algorithm with SeaView software v5.0.5 [45], with 200 bootstrap replications.

Minimum Spanning Tree Analysis
SNPs were extracted from the aligned coding sequences of the core genome.To represent the possible evolutionary relationships between strains (minimum spanning tree), we used the online version of the software PHYLOViZ v2.0 [46].

Virulence Assays
Bacterial strains were plated on LB − (consisting of LB medium without added NaCl (10 g/L peptone, 5 g/L yeast extract)) plates, incubated for 16 h at 28 • C and re-suspended in KPO 4 50 mM pH 7.0 buffer.After wounding with a yellow tip, eight potato tubers and ten detached chicory leaves for each bacterial strain were inoculated with 2 × 10 6 bacteria and incubated at 26 • C in closed boxes to allow high humidity.After five days of incubation, symptoms on potato tubers were categorised according to the following scale: 1: maceration zone < 2 mm; 2: maceration zone < 5 mm; 3: maceration zone < 10 mm; and 4: maceration zone > 10 mm.For chicory leaves, the length of rotted tissue after 24 h incubation was measured to assess disease severity.Both assays were carried out in triplicate, and the three sets of data were pooled.
Funding: This research was supported by the project SPREE (ANR-17-CE32-0004-04) financed by the French National Agency for Research.

Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.

Figure 1 .
Figure 1.Whole genome MLSA phylogenic tree.Phylogenic tree built up from the concatenated sequences of 3403 homologous protein sequences (197,918 variable sites).One hundred bootstrap replicates were performed to assess the statistical support of each node.

Figure 1 .
Figure 1.Whole genome MLSA phylogenic tree.Phylogenic tree built up from the concatenated sequences of 3403 homologous protein sequences (197,918 variable sites).One hundred bootstrap replicates were performed to assess the statistical support of each node.

Figure 2 .
Figure 2. Minimum spanning tree-based genome SNP analysis.The tree is based upon 60,195 SNPs.The length of each branch (log scale) expressed in SNP numbers is indicated.

Figure 2 .
Figure 2. Minimum spanning tree-based genome SNP analysis.The tree is based upon 60,195 SNPs.The length of each branch (log scale) expressed in SNP numbers is indicated.

14 Figure 3 .
Figure 3. Synteny between the genomes of the three strains CFBP 5647, IPO2222 and RNS05.1.2Abelonging to the three D. solani clades.Genomes were manually realigned to D. solani CFBP 5647 sequence to highlight synteny.Synteny analysis was performed using the MAUVE software v2015-02-25.P numbers indicate complete prophages as presented in Table 2.The genomic regions detected by the SiLix analysis in CFBP 5647 are indicated.

Figure 3 .
Figure 3. Synteny between the genomes of the three strains CFBP 5647, IPO2222 and RNS05.1.2Abelonging to the three D. solani clades.Genomes were manually realigned to D. solani CFBP 5647 sequence to highlight synteny.Synteny analysis was performed using the MAUVE software v2015-02-25.P numbers indicate complete prophages as presented in Table 2.The genomic regions detected by the SiLix analysis in CFBP 5647 are indicated.

Figure 4 .
Figure 4. Aggressiveness of D. solani strains isolated from water or tomato on potato tubers and chicory leaves.For potato (A), symptoms were assigned to four classes according to the extent of maceration five days post-inoculation (see Material and Methods).Pictures present examples of the typology of each class.For chicory leaves (B), disease severity was assessed by measuring the length of macerated tissue 24 h post-inoculation.Bars: standard deviation.Both assays were performed in triplicate, and the results were pooled.

Table 1 .
Strains used in this study.

Table 2 .
Complete * prophages detected in reference strains of the three D. solani clades.Prophages are considered complete if they obtained a PHASTER phage completeness score between 100 and 150. *

Table 3 .
Genomic regions specific of each D. solani clade.